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Abstract.  This  paper  presents  results  from  an  image  compression  scheme  based  on  iterated  ttansforms.  Results  are  examined 
as  a  function  of  several  encoding  parameters  including  maximum  allowed  scale  factor,  number  of  domains,  resolution  of  scale 
and  offset  values,  minimum  range  size,  and  target  fidelity.  The  performance  of  the  algorithm,  evaluated  by  means  of  fidelity 
versus  the  amount  of  compression,  is  compared  with  an  adaptive  discrete  cosine  transform  image  compression  method  over  a 
wide  range  of  compressions. 


/.usammrnfassung .  Diese  Arbeit  bringt  Resultate  fur  ein  Bildkompressionsverfahren.  basicrt  auf  iterativen  Transformationen. 
Die  Ergebnisse  werden  in  Hinblick  auf  mehrere  Codierungsparameter  untersucht  wie  maximaler  erlaubter  Skalierungsfaktor. 
Anzahl  von  Gebieten.  Auflosung  und  Offset-Werte.  minimaler  Aussteuerungsbcreich  sowie  Wiedergabetreue.  Die  Leistungs- 
fahigkeit  des  Algorithmus,  dargestellt  dutch  die  Wiedergabetreue  als  Funktion  der  (Compression,  wird  mit  einer  adaptiven 
diskreten  Kosinustransformationsmethode  zur  Bildkompression  fiber  einen  weiten  Bcreirh  von  Kompressionswerten  verglichen. 


Resume.  Cet  article  presente  des  resultats  concernant  une  methode  de  compression  d'image  basee  sur  les  transformers.  Ces 
results  Is  sont  examines  en  fonction  de  diflerents  parametres  d'encodage  induant  le  facteur  d’echelle  maximum  autorise,  le 
nombre  de  domaines,  la  precision  des  valeurs  d'echelle  et  d'offset.  la  taille  minimum  d'intervalle  et  I'objectif  de  qualite.  La 
performance  de  falgorithme.  evaluee  au  moyen  du  rapport  de  la  qualite  en  fonction  de  la  compression,  est  comparee  a  un 
schema  de  compression  base  sur  la  transformee  en  cosinus  adaptative  sur  une  large  gamme  de  debits. 


Keywords.  Image  compression :  fractals:  iterated  transformations. 


I.  Introduction 

Because  of  the  increasing  use  of  digital  imagery, 
there  is  currently  considerable  interest  in  the  image 
compression  problem.  This  interest  has  led  to  the 
establishment  by  the  Joint  Photographic  Experts 
Group  of  an  image  compression  standard  based  on 
discrete  cosine  transforms.  Although  the  use  of  this 
standard  is  becoming  common,  there  are  alterna¬ 
tive  methods  for  compressing  images.  One  such 
alternative,  iterated  transformations,  has  been  pre¬ 
sented  by  Jacquin  [6-8],  This  method  has  its  found¬ 
ation  in  the  theory  of  iterated  function  systems 
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(IFSs),  developed  by  Hutchinson  [5]  and  Barnsley 
[  1],  and  recurrent  iterated  function  systems  (RIFS) 
[2].  For  a  description  of  the  basic  iterated  trans¬ 
form  method,  refer  to  [6-8],  This  method  has  been 
extended  to  include  individual  transforms  which  are 
not  contractive  (4J.  Because  this  is  a  relatively  new 
method,  little  information  is  currently  available  on 
its  performance.  In  writing  even  a  simple  iterated 
transform  encoder,  there  are  a  host  of  parameters 
which  can  be  varied.  Currently,  there  is  no  infor¬ 
mation  available  on  the  dependence  of  system  per¬ 
formance  on  such  parameters.  The  purpose  of  this 
paper  is  to  present  results  on  the  dependence  of 
compression,  fidelity  and  encoding  time,  on  several 
pertinent  system  parameters.  For  the  purpose  of 
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defining  the  parameters  of  interest  in  this  paper,  a 
brief  summary  of  the  method  is  presented  in  the 
following  paragraphs.  A  more  detailed  description 
of  the  method  in  the  notation  used  in  this  corre¬ 
spondence,  and  a  description  of  the  basic  imple¬ 
mentation  used  here  can  be  found  in  [4]. 

/.  I.  Theoretical  background 

The  image  is  encoded  in  the  form  of  an  iterative 
system  (a  space  and  a  map  from  the  space  to  itself) 
W\  F-*  F.  The  space  F  is  a  complete  metric  space 
of  images,  and  the  mapping  W  (or  some  iterate  of 
IV)  is  a  contraction.  The  contractive  mapping  fixed 
point  theorem  ensures  convergence  to  a  fixed  point 
upon  iteration  of  W.  The  goal  is  to  construct  the 
mapping  W  with  fixed  point  ‘close’  (based  on  a 
properly  chosen  metric  S(fig))  to  a  given  image 
that  is  to  be  encoded,  and  such  that  W  can  be 
stored  compactly.  The  collage  theorem  provides 
motivation  that  a  good  mapping  can  be  found  [1]. 
Decoding  then  consists  of  iterating  the  mapping  W 
from  any  initial  image  until  the  iterates  converge 
to  the  fixed  point. 

Let  /=  [0,  1]  and  lm  be  the  m-fold  Cartesian  pro¬ 
duct  of  /  with  itself.  Let  F  be  the  space  consisting 
of  all  graphs  of  real  Lebesgue  measurable  functions 

:=f(x,y)  with  (x,  y,f(x,  y))el3.  Let  D . D„ 

and  Rt . R„  be  subsets  of  I2  and 

v . .  iv /'  -* /’  be  some  collection  of  maps. 

Define  tv,  as  the  restriction 

Wi  =  Vi\D,nf. 

The  maps  tv . .  w„  are  said  to  tile  I2  if  for  all 

feF,  U"- 1  Wi(/)eF.  This  means  the  following:  for 
any  image  feF,  each  D,  defines  a  part  of  the  image 
f  r\{D,  x  /)  to  which  w,  is  restricted.  When  tv,  is 
applied  to  this  part,  the  result  must  be  a  graph  of 
a  function  over  Rt,  and  I2=  U"-i  A-  This  is  illus¬ 
trated  in  Fig.  1.  This  means  that  the  union 
U"-t  wiif)  yields  a  graph  of  a  function  over  I2, 
and  that  the  R,' s  are  disjoint.  The  map  IF  is  defined 
as 

n 

W~\Jw„  (I) 

/-i 


Since  the  goal  is  to  limit  the  memory  required  to 
specify  W,  1 2  is  partitioned  by  geometrically  simple 
sets  Ri  with  (J,"=  i  R,  =  I2-  For  each  R,,  a  Did2  and 
tv, :D,  x  I -*  I3  is  sought  such  that  tv,<  f)  is  as  8  close 
to  fn  (Ri  x  /)  as  possible;  that  is, 

S(fn(Ri*I),  w,(f))  (2) 

is  minimized.  The  motivation  for  minimizing 
expression  (2)  is  provided  by  the  collage  theorem 

HI. 

Those  initiated  to  IFS  theory  may  find  it  surpris¬ 
ing  that  when  the  tranformations  >v,  are  con¬ 
structed,  it  is  not  necessary  to  impose  any 
contractivity  conditions  on  the  individual  trans¬ 
forms.  The  necessary  contractivity  requirement  is 
that  fV  be  eventually  contractive  [4].  A  map 
W:F-*F  is  eventually  contractive  if  there  exists  a 
positive  integer  m  such  that  the  mth  iterate  of  W 
( W0”)  is  contractive  (as  measured  by  an  appropri¬ 
ate  metric).  All  contractive  maps  are  eventually 
contractive,  but  not  vice  versa. 

A  brief  explanation  of  how  a  transformation 
W:F-*F  can  be  eventually  contractive  but  not 
contractive  is  in  order.  The  map  W  is  composed  of 
a  union  of  maps  tv,  operating  on  disjoint  parts  of 
an  image.  If  any  of  the  tv,  are  not  contractive,  then 
W  will  also  not  be  contractive.  The  iterated 
transform  is  composed  of  a  union  of  composi¬ 
tions  of  the  form 

tV, |  °  IV, 2  o  •  •  •  o  . 

Since  the  product  of  the  contractivities  bounds  the 
contractivity  of  the  compositions,  the  compositions 
may  be  contractive  if  each  contains  sufficiently 
contractive  tv,, .  Thus  W  will  be  eventually  contrac¬ 
tive  if  it  contains  sufficient  ‘mixing’  so  that  the  con¬ 
tractive  tv,  eventually  dominate  the  expansive  ones. 

1.2.  Implementation 

For  256  x  256  pixel  8  bit  per  pixel  (bpp)  images 
the  model  was  scaled  to  [0, 255]  x 
[0, 255]  x  [0, 255].  (For  other  image  sizes,  appro¬ 
priate  scaling  can  be  employed.)  To  limit  the 
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Fig.  1 .  Parts  of  the  tiling  of  an  image. 


memory  required  to  specify  tv,,  only  maps  of  the 
form 


X 

a,  bi 

0 

X 

e, 

y 

= 

c ,  di 

0 

y 

+ 

f 

~ 

O 

O 

_ 1 

Si 

2 

0, 

(3) 


are  considered,  where  w,  is  restricted  to  A  x  /.  In 
terms  of  an  image,  x  and  y  represent  the  pixel  coor¬ 
dinates,  and  2  represents  the  pixel  intensity.  The 
pixel  intensities  are  clipped  when  w,  maps  outside 
the  allowed  intensity  range.  For  the  transformation 
w,  to  be  compactly  specified,  R,  and  A  must  be 
compactly  specified.  This  can  be  done  by  choosing 
Ri  and  A  from  a  small  set  of  potential  candidates. 
Also,  many  of  the  computations  are  simplified  if 
Ri  and  A  are  geometrically  simple.  Let  R  be  the 
collection  of  subsets  of  I2  from  which  the  R,  are 
chosen,  and  let  D  be  the  collection  of  subsets  of  I2 
from  which  the  A  are  chosen.  The  set  R  was  chosen 
to  consist  of  4  x  4,  8  x  8,  16x16  and  32  x  32  non¬ 
overlapping  subsquares  of  [0, 255]  x  [0, 255]  (i.e., 
squares  of  size  jxj  have  their  upper  left  comers  at 
integer  multiples  of  2).  The  collection  D  consisted 
of  8  x  8,  16  x  16,  32  x  32  and  64  x  64  subsquares 
with  sides  parallel  to  or  slanted  at  45°  angles  from 
the  natural  edges  of  the  image.  Although  the  A’s 
and  Ri’ s  are  not  strictly  the  domains  and  ranges  of 
the  w/s,  the  terminology  will  be  used  because  it 


is  descriptive.  Domain  squares  of  size  s*s  were 
restricted  to  have  comers  on  a  lattice  with  vertical 
and  horizontal  spacing  of  s/2.  This  choice  of  D  will 
be  called  D'.  It  is  clear  then  that  the  size  and  posi¬ 
tion  of  A.  the  size,  position  and  orientation  (i.e.. 
0°  or  45°)  of  A,  and  which  one  of  the  eight  possible 
symmetries  (rotation  and  flip  operator  v,)  for  map¬ 
ping  one  square  onto  another,  uniquely  define  the 
coefficients  a,,  b,,  c,,  d,,  e,  and  f  in  (3).  In  this 
implementation,  only  domains  with  side  length 
twice  that  of  the  range  are  allowed,  resulting  in 
contraction  in  the  xy  plane.  Therefore,  each  range 
pixel  corresponds  to  a  two  by  two  pixel  area  in 
the  corresponding  domain.  The  average  of  the  four 
domain  pixel  intensities  are  mapped  to  the  area  of 
the  range  pixel  when  computing  w,{  f).  When  A  is 
oriented  at  45°,  the  averaging  of  pixels  is  more 
complicated,  the  details  of  which  are  not  very  sig¬ 
nificant.  What  is  significant  is  that  the  method  for 
averaging  pixels  in  the  encoding  and  decoding  pro¬ 
cedures  are  consistent. 

Insisting  that  wt  map  (the  graph  above)  A  to  (a 
graph  above)  /?,  while  minimizing  expression  (2) 
determines  s ,  and  ot.  In  this  way  w,  is  determined 
uniquely  for  a  chosen  metric.  In  this  paper,  the 
root  mean  square  (rms)  error  (£„„)  was  chosen  as 
the  metric.  For  further  comment  on  this  choice  of 
the  metric,  see  [4] .  Because  st  and  o,  must  be  stored 
with  a  fixed  number  of  bits  (n,  and  n„ ,  respectively), 
they  must  be  discretized.  The  values  for  s,  were 
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restricted  to  the  range  smax  ^  |s,|  >  (smax/10)  and  s,  = 
0,  where  smix  was  the  maximum  allowable  s,.  The 
distribution  of  the  discretized  s/s  was  chosen  such 
that  any  desired  scale  factor  could  be  represented 
within  some  fixed  percentage  (i.e.,  a  logarithmic 
scale).  The  minimum  and  maximum  possible 
values  for  o,  are  restricted  by  the  value  of  the  corre¬ 
sponding  Si.  Given  n„,  the  discretized  values  for  o, 
were  distributed  linearly  over  this  interval.  Only 
the  discretized  values  of  s,  and  o,  are  used  when 
calculating  the  values  of  s,  and  o,  which  minimizes 
expression  (2). 

Once  the  choice  of  R  and  D  has  been  made, 
the  encoding  problem  is  reduced  to  choosing  a  set 
{ /?,}<=/?,  and  the  corresponding  set  {D,}  <=D,  such 
that  good  compression  and  an  accurate  encoding 
of  the  image  results.  To  take  advantage  of  local 
'flatness'  in  the  image  and  to  reduce  the  error  in 
regions  of  high  variability,  a  recursive  quadtree 
partitioning  method  was  used  to  allow  the  range 
squares  to  vary  in  size. 

The  method  used  to  find  the  D/s  determines  how 
much  computation  time  the  encoding  takes.  A 
search  through  all  of  D  would  clearly  result  in  the 
choice  that  would  best  minimize  expression  (2), 
but  for  applications  for  which  encoding  time  is  a 
consideration,  such  a  search  may  require  too  much 
computation  time.  Therefore  a  classification 
scheme  was  used  to  reduce  the  amount  of  computa¬ 
tion  needed  to  find  a  good  covering.  The  strategy 
of  the  classification  scheme  is  important,  although 
the  fine  details  of  the  implementation  method  are 
not  critical.  The  following  paragraph  reviews  the 
important  aspects  of  the  classification  scheme 
used. 

The  classification  scheme  used  was  based  on  the 
simple  ideas  that  good  covers  would  have  matching 
regions  of  bright  and  dark  and  that  any  strong 
edges  (variations  in  intensity)  should  also  match. 
The  classification  scheme  in  [6]  was  generally  based 
on  these  ideas.  The  classification  scheme  began  by 
computing  the  average  intensity  for  each  quadrant 
of  the  (range  or  domain)  square.  Then  a  symmetry 
operation  was  applied  to  force  the  square  into  an 
orientation  with  its  brightest  quadrant  in  the  upper 

S*fftal  frocnamg 


left  quadrant,  and  to  put  the  second  brightest 
quadrant  into  the  upper  right  quadrant  (or  the 
third  brightest  if  the  second  brightest  could  not  be 
so  oriented).  This  process  divided  the  squares  into 
three  main  classes  (and  defined  a  symmetry  opera¬ 
tion  for  each  square).  Each  of  these  three  main 
classes  was  then  subdivided  by  determining  the 
amount  of  the  variation  of  the  average  brightness 
of  the  sub-quadrants  for  each  quadrant  of  the 
square.  The  quadrants  of  the  square  were  thereby 
ordered  from  first  to  fourth  by  the  amount  of  varia¬ 
tion  within  each  quadrant.  There  are  24  possible 
permutations  for  the  order  of  the  relative  bright¬ 
ness  variations.  This  results  in  a  total  number  of 
72  classes.  The  symmetry  operations  determined  in 
this  classification  for  a  given  R,  and  D,  defined  the 
rotation  and  flip  operation  v,  for  mapping  D,  to 
Rt.  further  increasing  the  time  saved  during 
encoding. 

The  encoding  process  proceeded  as  follows.  Ini¬ 
tially,  the  range  squares  R,  were  chosen  to  be  64 
subsquares  of  size  32  *  32.  The  first  32  *  32  range 
square  was  classified  using  the  same  classification 
procedure  as  the  domains.  A  search  was  then  per¬ 
formed  for  the  domain  square  (with  side  length 
twice  that  of  the  range  square)  in  the  same  class 
(or  similar  classes)  which  best  minimized  expres¬ 
sion  (2).  If  this  best  domain  square  and  its  corre¬ 
sponding  h-  resulted  in  an  error  (given  by 
expression  (2))  less  than  a  predetermined  tolerance 
(Cc),  h’  (and  D)  was  stored  and  the  process  was 
repeated  for  the  next  range  square.  If  the  predeter¬ 
mined  tolerance  was  not  satisfied,  the  range  square 
was  subdivided  into  four  equal  squares.  This 
quadtreeing  process  was  repeated  until  the  toler¬ 
ance  condition  was  satisfied,  or  a  range  square  of 
a  predetermined  minimum  size  rmin  was  reached. 
For  range  squares  of  size  rmi„ ,  the  best  w  was  stored 
whether  or  not  ec  was  satisfied.  The  process  was 
continued  until  the  entire  image  was  encoded.  The 
average  storage  requirement  for  a  single  w  was 
about  30  bits,  and  was  dependent  on  several  fac¬ 
tors.  The  rotation  and  flip  operator  v,  required  3 
bits,  and  for  most  encodings  presented  here  n,  and 
n„  were  stored  with  a  total  of  12  bits.  The  position 


E.  W.  Jacobs  et  al.  /  A  study  of  iterated  transforms 


255 


of  R,  was  inferred  from  the  ordering.  Approxi¬ 
mately  15  bits  were  required  to  identify  the  size  of 
/?,  and  the  location  (and  orientation)  of  D,,  this 
number  being  dependent  on  both  the  choice  of  D 
and  the  level  in  the  quadtree.  When  s,  =  0,  only  o, 
and  the  size  of  R ,  are  required,  so  the  transforma¬ 
tion  can  be  stored  more  compactly.  The  com¬ 
pressions  quoted  in  this  paper  were  computed  from 
the  actual  size  of  the  compressed  data  files. 

Clearly  this  encoding  method  has  many  param¬ 
eters  which  influence  the  compression,  accuracy 
and  speed  of  the  encoding.  How  these  parameters 
effect  the  encoding  is  not  a  priori  obvious.  The 
following  discussion  will  consider  how  varying  the 
number  of  possible  domains  effects  the  encoding 
accuracy,  compression  and  speed.  The  following 
assumes  that  one  encoding,  call  it  W\  uses  a  given 
set  of  possible  domains  D ',  while  the  other  fV"  uses 
a  domain  set  D"  such  that  D'  <=  D".  First,  it  is  clear 
that  for  any  given  range  square,  the  best  covering 
(in  the  sense  of  minimizing  expression  (2))  from 
D"  must  be  as  good  as  or  better  than  the  best  cover¬ 
ing  taken  from  D.  However,  this  does  not  guaran¬ 
tee  an  improved  encoding  within  the  quadtree 
method.  It  is  possible  that  some  large  range  which 
was  subdivided  in  W'  (and  then  covered  very  well) 
will  not  be  subdivided  in  W",  resulting  in  a  less 
accurate  encoding.  Second,  since  D"  contains  D\ 
the  individual  wf  transforms  of  W"  require  more 
bits  to  store  than  the  hv’s  of  W'.  However,  this  does 
not  mean  that  W"  must  have  a  poorer  compression 
than  W' .  If  a  large  block  is  covered,  rather  than 
subdivided,  then  only  one  transform  (instead  of  at 
least  four)  needs  to  be  stored,  resulting  in  a  net 
savings  in  the  total  number  of  required  bits. 
Finally,  although  the  construction  of  W”  requires 
searching  through  a  bigger  domain  pool  than  the 
construction  of  W' ,  it  does  not  follow  that  the 
encoding  process  must  take  more  time.  When  a 
large  block  is  covered  the  four  smaller  ranges  do 
not  need  to  be  covered,  thus  saving  time. 

This  paper  describes  the  dependence  of  the  per¬ 
formance  of  the  encoding  scheme  on  the  following 
parameters :  ( 1 )  the  number  of  bits  used  to  store  the 
scale  factor  («,)  and  offset  (n„),  (2)  the  maximum 


allowed  s,  (.w),  (3)  the  number  of  possible 
domains,  (4)  the  criterion  used  to  decide  if  a  cover¬ 
ing  is  acceptable  (ej,  (5)  the  minimum  range  size 
in  the  quadtree  subdivision  (rmin),  and  (6)  the 
number  of  domain  classes  searched  (nc)-  It  is  not 
obvious  how  adjusting  these  parameters  effects  the 
performance  of  the  encoding  method,  since  for 
each  of  these  parameters  it  is  possible  to  construct 
arguments  similar  to  that  given  previously  for  the 
total  number  of  possible  domains. 

Before  presenting  results,  a  tutorial  example  is 
given  to  better  illustrate  how  the  method  works. 
Let  values  of  z  —  0  be  represented  as  black,  z  =  1  as 
white,  with  intermediate  values  as  shades  of  gray. 
Consider  the  image  in  Fig.  2,  and  the  sixteen  trans¬ 
formations  given  in  Table  1,  where  the  first  eight 
transformations  are  restricted  to  act  on  the  region 
{(x,jy)|0$x<5,  and  the  second  eight 

transformations  are  restricted  to  act  on  the  region 
{(•*>>')!  2  i}.  The  map  W  is  defined 

as  the  union  of  these  16  h>,’s  and  encodes  the  image 
shown.  This  can  be  easily  demonstrated.  The  start¬ 
ing  point  of  the  iteration  is  arbitrarily  chosen  as 
z  =  0.5  for  {(.v,y)|0^.v^  1,  lj.The  first  six 
iterates,  and  the  fixed  point  are  shown  in  Fig.  3.  In 
practice,  the  values  of  v,  y  and  z  are  discretized. 
When  the  image  in  this  example  is  discretized  as 
128  x  128  pixels,  and  8  bits  per  pixel,  the  encoder 


(0,1) 


(0,0)  (1,0) 
Fig.  2.  A  gray  scale  image. 
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Table  1 


A  set  of  1 6  transformations  that  encode  the  image  in  Fig.  2 


a 

b 

C 

d 

S 

e 

/ 

o 

0.5 

0.0 

0.0 

0.5 

2.0 

0.75 

0.0 

0.0 

0.5 

0.0 

0.0 

0.5 

2.0 

0.75 

0.25 

0.0 

0.5 

0.0 

0.0 

0.5 

2.0 

0.50 

0.0 

0.0 

0.5 

0.0 

0.0 

0.5 

2.0 

0.50 

0.25 

0.0 

0.5 

0.0 

0.0 

0.5 

2.0 

0.0 

0.5 

0.0 

0.5 

0.0 

0.0 

0.5 

2.0 

0.0 

0.75 

0,0 

0.5 

0.0 

0.0 

0.5 

2.0 

0.25 

0.5 

0.0 

0.5 

0.0 

0.0 

0.5 

2.0 

0.25 

0.75 

0.0 

0.0 

-0.5 

0.5 

0.0 

0.25 

0.25 

-0.25 

0.0 

0.0 

-0.5 

0.5 

0.0 

0.25 

0.25 

0.0 

0.25 

0.0 

-0.5 

0.5 

0.0 

0.25 

0.5 

-0.25 

0.25 

0.0 

-0.5 

0.5 

0.0 

0.25 

0.5 

0.0 

0.0 

0.0 

-0.5 

0.5 

0.0 

0.25 

0.75 

0.25 

0.0 

0.0 

-0.5 

0.5 

0.0 

0.25 

0.75 

0.5 

0.25 

0.0 

-0.5 

0.5 

0.0 

0.25 

1.0 

0.25 

0.25 

0.0 

-0.5 

0.5 

00 

0.25 

1.0 

0.5 

0.0 

described  in  this  paper  automatically  encodes  this 
image  (using  an  equivalent  set  of  16  trans¬ 
formations)  with  the  resulting  compression  equal 
to  356:1.  Note  that  smat  =  2.0  for  this  encoding. 
The  encoder,  when  constrained  to  have  r,<l, 
requires  520  transformations  to  encode  this  image, 
with  a  resulting  compression  of  less  than  10:1. 


2.  Results 

The  peak-signal-to-noise  ratio  (PSNR)  was  used 
to  determine  image  fidelity.  PSNR  is  defined  as 

PSNR  =  -201og,„(~), 

where  rms  is  the  root  mean  square  error  of  the 
reconstructed  image  and  n  is  the  number  of  bits 
per  pixel  of  the  image. 

Figure  4  shows  PSNR  versus  compression  as  a 
function  of  ns  and  rt0 ,  the  number  of  bits  used  to 
store  values  for  s  and  o,  respectively.  This  data 
results  from  encodings  of  the  512  x  512  pixel  reso¬ 
lution,  8-bpp  image  of  Lena.  For  these  encodings, 
the  full  set  D'  as  described  in  Section  1  was  used. 
The  other  parameters  used  for  these  encodings 
were:  minimum  range  size  of  4  x  4  pixels  (rmin  =  4), 
maximum  allowed  scale  factor  set  to  1.0 
1.0),  root  mean  square  error  tolerance  (ec)  set  to 
8.0,  and  number  of  classes  searched  equal  to  4  ( nc  = 
4).  Results  for  ns  and  n„  equal  to  3,  4,  5,  6,  7  and 
8  are  shown.  The  curves  connect  the  data  points 
with  the  same  value  of  ns.  The  different  symbols 
indicate  the  value  of  n„ .  By  connecting  the  similar 


Fig.  3.  The  starting  image,  the  first  six  iterates,  and  the  fixed  point  for  the  map  composed  of  the  16  transformations  in  Table  t. 
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COMPRESSION 

Fig.  4.  PSNR  versus  compression  as  a  function  of  n,  and  n„  for 
512  x  512  Lena.  The  values  of  n„  =  3, 4,  5, 6, 7  and  8  are  denoted 
by  O,  •,  □,  ■.  A  and  A,  respectively.  The  values  of  n,  are 
noted  on  the  plot,  the  lines  connecting  points  with  common 
Other  parameters  were  O'  and  (rm,„,  ,  ec,  nj  equal  to 

(4,  1.0,  8.0, 4),  respectively. 


Fig.  6.  PSNR.  versus  compression  as  a  function  of  *,»».  The 
curve  at  lower  compression  is  for  256  x  256  'tank  farm’,  the 
curve  at  higher  compression  for  256  *  256  Lena.  The  values  of 
Jm„'0.5,  0.8,  1.0,  1.2,  1.5  and  2.0  are  denoted  by  A.  A,  ■, 
□,  •  and  O,  respectively.  Other  parameters  were  He  =  72,  with 
others  the  same  as  Fig.  S. 


symbols,  one  can  visualize  the  curves  of  constant 

n„. 

The  lack  of  contractivity  condition  on  s,  is 
reflected  in  Figs.  5  and  6  where  data  from  enco¬ 
dings  with  s™,  as  high  as  2.0  are  presented.  Figure 


(a) 


(c) 


_ lilting  «aUlll _ 

i - 1 - 1  i - 1 

44  I  U 

KALI  FACTOR 

Fig.  5.  Relative  number  of  t,'»  at  each  allowed  value  for  three 
encodings  of  256  *  256  Lena.  The  value  of  for  a,  b  and  c 
were  2.0,  1 .0  and  0.5,  respectively.  Other  parameters  were  n,  - 
5,  n. -7,  with  other  parameters  the  same  as  Fig.  4. 


S  shows  the  distribution  of  the  scale  factors  used 
in  three  different  encodings  of  256  x  256  pixel  reso¬ 
lution,  8-bpp  Lena.  The  values  of  for  these 
encodings  were  2.0,  1.0  and  0.5  in  Figs.  5(a),  5(b) 
and  5(c),  respectively.  Other  parameters  were  N,= 
5,  «o=7,  with  all  other  parameters  the  same  as  in 
Fig.  4. 

In  Fig.  6  PSNR  versus  compression  is  plotted 
for  different  values  of  .w, .  The  curve  at  lower  com¬ 
pression  in  Fig.  6  is  for  the  256  *  256  pixel  resolu¬ 
tion,  8  bit  per  pixel  image  of  ‘tank  farm’  shown  in 
Fig.  7(a).  The  decoded  image  for  smax=1.2  is 
shown  in  Fig.  7(b).  The  curve  at  higher  com¬ 
pression  is  for  256  x  256  Lena. 

Figure  8  shows  PSNR  versus  compression  as  a 
function  of  the  number  of  domains  for  512  x  512 
Lena.  Ail  the  results  in  Fig.  8  used  the  parameters 
imu  =  1  0,  rmh,  =  4,  n,  =  5,  =  7  and  ec  =  8.0.  The  set 

Dl/*,  with  number  of  domains  reduced  by 
(approximately)  a  factor  of  four  from  the  set  D\ 
was  obtained  by  restricting  the  domains  to  have 
comers  on  a  lattice  with  vertical  and  horizontal 
spacing  of  s  (as  opposed  to  s/2).  The  set  D'n ,  with 
number  of  domains  reduced  by  a  factor  of  two 
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Fig.  7.  (a)  256  x  256  8  bit/pixel  original  image  of 'tank  farm', 
(b)  Decoded  image  with  jm„  =  1.2.  For  this  image  the  com¬ 
pression  is  5.96: 1  and  the  PSNR  is  30.98  dB. 


from  the  set  D‘,  was  obtained  by  eliminating  the 
domains  with  sides  slanted  at  45°  angles  from  the 
natural  edges  of  the  image.  The  set  D1  8  was 
obtained  by  both  restricting  the  lattice  spacing  and 
eliminating  the  diagonal  domains.  As  mentioned  in 
Section  1,  the  domains  were  classified  into  72 


Fig.  8.  PSNR  versus  compression  as  a  function  of  the  number 
of  domains  for  encodings  of  512  x  512  Lena.  The  sets  O',  O'  :, 
O'  4  and  O'  *  are  denoted  by  •,  O,  ■  and  □,  respectively. 
Data  for  n,  =  1 , 4  and  1 2  appear  as  the  solid,  dashed  and  dotted 
curves,  respectively.  Other  parameters  were  sm„  =  1 .0  with  other 
parameters  the  same  as  in  Fig.  5, 

classes,  and  only  a  predetermined  number  of 
classes  (nc)  were  searched  during  encoding.  There¬ 
fore,  increasing  nc  effectively  increased  the  number 
of  possible  domains.  Figure  8  shows  data  nc  - 1 
(connected  by  solid  line),  4  (dashed)  and  12  (dot¬ 
ted)  for  each  of  the  domain  sets.  The  open  symbols 
are  for  cases  in  which  no  diagonal  domains  are 
allowed  (Dl  2  and  Dl  s),  while  the  solid  symbols 
include  diagonal  domains  (O'  and  D'  4).  The 
circles  indicate  domain  lattice  spacing  of  s/2  (/>' 
and  D'  2),  while  the  squares  indicate  lattice  spacing 
of  s(D'  4  and  D'  "). 

Finally,  Fig.  9  shows  PSNR  versus  compression 
data  of  512  *  512  Lena  for  ec  =  5.0,  8.0  and  11.0. 
The  open  symbols  represent  data  for  rmjn  =  4.  Data 
is  shown  for  nc=  1  (O),  4  (0)  and  72  (A).  The 
value  of  ec  separates  the  data  into  three  widely 
spaced  clusters,  ec=5.0  for  the  cluster  with  the 
highest  PSNR,  and  ec=  11.0  for  the  cluster  with 
the  lowest  PSNR.  The  solid  symbols  represent  the 
same  set  of  data,  but  with  rmin  =  8.  Other  param¬ 
eters  were  set  as  follows :  sm,%  ~  1 .2,  ns  =  5  and  na  = 
7.  The  original  image  of  Lena  and  the  decoded 
image  with  rmm ,  ec  and  equal  to  (8,  8,  72),  respec¬ 
tively,  are  shown  in  Fig.  10.  The  set  of  domains  D 1 
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Fig.  9.  PSNR  versus  compression  as  a  function  of  ec,  rmin  and 
nc  for  5 1 2  x  5 1 2  Lena.  Open  symbols  represent  rmi„ = 4  and  solid 
symbols  rm,„  =  8.  Data  for  ec  =  5.0,  8.0  and  1 1 .0  appear  in  clus¬ 
ters  with  increasing  compression.  Within  each  cluster,  tic  =  1 
(O).  4  (0)  and  72  (A).  ADCT  data  (□)  is  shown  for 
comparison. 


was  used  for  these  encodings.  For  the  purpose  of 
comparison,  data  obtained  from  an  adaptive 
discrete  cosine  transform  (ADCT)  method  is  also 
shown.  The  ADCT  method  used  was  similar  to 
that  described  in  [3], 


3.  Discussion 


In  order  to  compare  the  relative  merits  of  various 
sets  of  parameters  it  is  necessary  to  be  able  to 
decide  what  it  means  to  say  that  one  encoding  is 
‘better’  than  another.  For  images  encoded  using 
the  method  described  in  this  correspondence,  it  has 
been  observed  that  the  PSNR  is  a  reasonable  meas¬ 
ure  of  the  visual  quality  of  the  images  (i.e.,  given 
two  encodings,  the  one  with  the  larger  PSNR  looks 
better).  It  is  clear  that  the  best  possible  encoding 
would  have  both  maximum  compression  and  maxi¬ 
mum  fidelity.  In  practice  this  is  usually  not  pos¬ 
sible,  since  adjusting  a  parameter  to  improve  the 
compression  almost  always  results  in  a  degradation 
in  the  fidelity.  Consequently  it  is  difficult  to  com¬ 
pare  two  encodings  if  one  is  more  accurate  with 
poorer  compression. 


Fig.  10.  (a)  512  x  512  8  bit/pixel  original  image  of  Lena,  (b) 
Decoded  image  with  ec  =  8,  rmm  =  8  and  tu  =  72.  For  this  image 
the  compression  is  36.78: 1  and  the  PSNR  is  30.71  dB. 


The  following  observation  suggests  how  such  a 
comparison  can  be  made.  Varying  the  target  fidel¬ 
ity  (ec)  for  a  particular  set  of  encoding  parameters 
moves  an  encoding  along  an  empirical  barrier  by 
trading  compression  for  accuracy  in  a  roughly  lin¬ 
ear  way  (see  Fig.  9).  An  encoding  resulting  from  a 
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different  choice  of  parameters  is  better  if  it  has  a 
higher  compression  than  the  barrier  for  the  same 
fidelity  or  better  fidelity  for  the  same  compression. 
It  is  not  practical  to  compute  a  barrier  for  each 
encoding,  so  that  the  better  encoding  is  chosen 
using  an  estimate  of  the  barrier.  In  the  results  pre¬ 
sented  below,  this  is  what  is  meant  when  one  com¬ 
bination  of  parameters  is  described  as  superior  to 
another.  An  inclusion  of  the  encoding  time  further 
complicates  the  issue  and  will  be  neglected,  except 
where  specifically  indicated. 

Figure  9  illustrates  these  points.  Note  that  in  the 
three  data  points  with  rmin  -  8  and  nc  =  1 ,  increasing 
ec  resulted  in  a  marked  improvement  in  com¬ 
pression  with  a  corresponding  decrease  in  PSNR. 
The  curve  obtained  by  connecting  these  data  points 
represents  the  empirical  barrier.  By  increasing  n c  to 
4  and  72,  both  the  compression  and  the  PSNR  were 
improved,  resulting  in  an  improved  performance 
barrier.  This  improvement  was  achieved  at  the  cost 
of  increased  encoding  time.  The  set  of  data  with 
rrai„  =  4  shows  similar  behavior  with  some  notable 
differences.  The  slopes  of  the  empirical  barriers  for 
the  rmm  =  4  data  are  steeper  than  those  of  the  data 
with  rmin  =  8.  The  data  with  ec  =  1 1 .0  indicates  that 
increasing  nc  resulted  in  an  increase  in  compressi- 
son  and  a  decrease  in  PSNR.  Therefore  it  is  not 
obvious  that  increasing  improved  the  encoding. 
In  light  of  the  discussion  in  the  previous  paragraph, 
the  slope  of  the  empirical  performance  barriers  for 
the  rmm = 4  data  is  steep  enough  that  one  can  con¬ 
clude  that  increasing  nc  did  result  in  better  enco¬ 
dings  (i.e.,  the  improvement  in  compression  more 
that  compensated  for  the  decrease  in  fidelity). 

In  Fig.  4  a  grid  of  data  is  presented  with  different 
combinations  of  n ,  and  n„.  The  data  is  plotted  in 
this  way  so  that  one  can  choose  the  ‘best’  combina¬ 
tion  for  n,  and  na .  By  comparing  these  data  with 
slope  of  the  barrier  in  Fig.  9  (for  r„,i„  =  4)  one  can 
see  that  the  combination  n,  =  5,  na  =  7  is  the  best. 
Extensive  comparisons  (a  few  hundred  encodings 
of  several  images  at  various  values  of  ec  and  rmin) 
has  shown  that  plots  equivalent  to  Fig.  4  do  not 
all  yield  the  same  best  combination.  Nonetheless 
the  combination  n,  *  5,  n„ = 7  is  the  best  (compro¬ 
mise)  choice  for  the  entire  set.  Consequently,  all 
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other  results  are  given  for  encodings  using  ns  =  5, 
n„-  7. 

The  data  in  Fig.  5  show  the  relative  number  of 
each  scale  factor  used  to  encode  Lena  (encodings 
of  other  images  resulted  in  similar  data).  Concen¬ 
trating  on  Fig.  5(b)  (Smax  =  1 .0),  because  the  larger 
scale  factors  seem  to  be  preferred,  one  might 
hypothesize  that  a  distribution  of  s’s  with  more 
large  values  and  less  small  values  might  yield 
improved  encodings.  Experiments  with  both  linear 
and  inverse  logarithmic  distributions  for  allowed 
values  of  s  resulted  in  no  improvement  to  the 
results  presented  here.  In  addition,  the  minimum 
allowed  nonzero  5  can  be  changed.  Initial  data  indi¬ 
cates  that  the  values  used  were  good. 

In  Fig.  5,  note  that  for  smaller  smax  a  dispropor¬ 
tionate  number  of  s,’s  at  the  extremes  of  the 
allowed  range  of  s  are  used.  Therefore  one  might 
hypothesize  that  increasing  .w  will  result  in 
improved  encodings.  In  Fig.  6,  this  is  shown  to  be 
the  case.  The  data  in  this  figure  for  both  Lena  and 
‘tank  farm’  show  similar  results.  The  encodings 
with  JmM  =  0.5  and  0.8  yielded  the  poorest  results. 
The  encodings  improved  with  increasing  sm„  up 
to  -w  =  1 .5.  The  smax  =  2.0  case  yielded  a  result 
marginally  worse  than  the  =  1 .5  encoding. 
Results  for  a  variety  of  other  images  and  encoding 
parameters  indicate  that  s™,  =  1 .2  or  1.5  usually 
yields  the  best  PSNR  versus  compression  results. 
These  results  are  particularly  interesting  because 
they  show  that  it  is  possible  to  find  an  iterative 
system  with  a  fixed  point  which  is  closer  to  a  target 
by  allowing  some  of  the  individual  transforms  to 
be  non-contractive. 

It  is  of  interest  that  every  one  of  the  encodings 
with  5 mi*  ^  1 .0  (numerous  images  and  several 
hundred  separate  encodings)  converged  to  a  fixed 
point.  In  a  few  cases  (with  =  2.0)  a  mapping 
took  more  than  ten  iterations  to  converge,  but  in  all 
cases  with  1.2,  ten  iterations  were  sufficient. 
Since  in  practice  it  is  desirable  to  perform  only  a 
small  number  of  decoding  iterations,  smtx  =  1.0  or 
1.2  were  used  for  all  other  encodings  in  this  corre¬ 
spondence.  The  question  of  contractivity  is  impor¬ 
tant,  and  is  very  much  dependent  on  the  metric 
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which  is  used.  For  instance,  the  following  proce¬ 
dure  can  be  used  to  check  if  a  mapping 
W=  (J,  h>,  is  eventually  contractive  for  the  metric 
Ssup(f  g)  =  supu. y)et1\f(x,  y) -g{x,  y)|.  Begin  with 
an  image  /  such  that  f(x,  y)  =  1.0.  Define  w',  as  w> 
with  o,  =  0,  and  W'  =  (J,  wj .  Then 

sup  { W'"0"(/(*,  >0)} 

(x,y)el2 

will  be  the  contractivity  a  of  fV°n.  To  check  if  W 
is  eventually  contractive,  iterate  W'  until  a  <  1 . 
This  test  only  determines  eventual  contractivity 
after  an  encoding  has  been  made.  A  similar  proce¬ 
dure  for  the  rms  metric  is  not  known  to  the 
authors.  It  is  relevant  to  note  that  a  mapping  W 
which  is  contractive  for  <5sup  may  only  be  eventually 
contractive  for  <5™,,.  Unlike  the  sup  metric,  the 
condition  5/  <  1 .0  is  not  sufficient  to  ensure  contrac¬ 
tivity  for  the  rms  metric.  However,  this  condition 
is  sufficient  to  ensure  eventual  contractivity  for  the 
rms  metric. 

Unlike  the  parameters  investigated  in  Figs.  4-6 
and  9,  the  number  of  available  domains  more 
directly  effects  the  encoding  time.  This  is  important 
to  keep  in  mind  when  examining  the  data  in  Fig. 
8.  The  data  indicate  that  increasing  nc  typically 
resulted  in  a  moderate  increase  in  the  PSNR  with 
a  marked  increase  in  the  compression.  Conversely, 
including  diagonals  typically  resulted  in  little 
change  in  the  compression,  but  in  a  marked 
increase  in  the  PSNR.  Decreasing  the  lattice  spac¬ 
ing  (increasing  the  number  of  domains)  resulted 
in  increases  in  both  PSNR  and  compression.  This 
means  that  for  all  of  these  different  ways  of  increas¬ 
ing  the  number  of  domains,  the  performance 
improved.  Continuing  studies  are  underway  to 
determine  at  what  point  an  increase  in  the  number 
of  domains  results  in  decreased  performance. 

In  Table  2,  the  relative  encoding  times  for  several 
encodings  are  presented.  The  data  indicate  the  rela¬ 
tive  encoding  time  as  a  function  of  nc,ec,  rm m  and 
the  image  size.  On  an  HP-Apollo  400t  workstation, 
the  relative  time  units  used  in  Table  2  are  equal  to 
approximately  1170  epu  seconds.  This  code  has  not 
been  optimized  for  speed. 


Finally,  returning  to  Fig.  9,  data  using  the 
domain  set  Dl ,  ns  =  5,  na  =  7  and  smax  =1.2,  are  pre¬ 
sented  over  a  wide  range  of  compressions.  The 
resulting  compression  was  varied  by  the  choice  of 
ec  and  rmin.  As  in  Fig.  8,  it  is  shown  that  (at  the 
cost  of  speed)  the  encodings  can  be  improved  by 
increasing  nc.  Figure  9  and  Table  2  indicate  the 
trade  off  between  encoding  time  and  PSNR  when 
varying  nc,  and  give  an  indication  of  the  efficiency 
of  the  classification  method.  For  comparison, 
results  are  also  shown  for  an  ADCT  algorithm 
similar  to  that  described  in  [3],  The  ADCT  data 
shown  is  as  good  as  the  JPEG  standard  at  lower 
compression,  and  better  than  the  JPEG  standard 
at  higher  compression  [9],  The  PSNR  versus  com¬ 
pression  performance  of  the  current  iterated  trans¬ 
form  method  is  comparable  to,  but  not  as  good  as, 
the  ADCT  algorithm. 

It  is  interesting  to  compare  the  implementation 
presented  here  with  that  of  [6].  Among  several 
other  differences,  some  major  differences  are  that  in 
[6]  the  restriction  s*  <  1 .0  was  imposed,  a  Hausdorff 
metric  as  well  as  rms  criteria  was  used  during  enco¬ 
ding,  only  10  values  for  s  were  allowed,  and  the  set 
of  allowed  domains  was  localized  and  small  (about 
the  same  in  number  as  J)1'8).  Therefore,  relative  to 
any  of  the  encodings  presented  here,  more  trans¬ 
formations  requiring  fewer  bits  each  were  used  to 
encode  the  image.  It  is  interesting  that  this  different 
approach  yields  comparable  results,  although  the 
implementation  described  in  this  correspondence 
(last  encoding  in  Table  2)  results  in  an 
improvement  of  1 .4  dB  at  the  same  bit-rate.  In  a 
later  reference  by  the  same  author  [8],  512  x  512 
images  were  encoded  with  an  algorithm  similar  to 
that  of  [6].  In  Table  2,  the  512  x  512  encoding  with 
nc  =  1  indicates  a  1 .8  dB  improvement  in  fidelity 
at  slightly  better  compression.  (Note  that  in  [6-8] 
fidelity  is  computed  as  SNR,  not  PSNR.) 

In  the  discussion  of  Fig.  4,  it  is  specifically  noted 
that  in  choosing  ‘best’  values  for  n,  and  n„,  data 
from  several  images  were  considered.  Data  in  Fig. 
6  is  presented  for  two  images,  and  in  Fig.  7  for  one 
image.  Results  on  several  other  images  that  have 
been  encoded  show  qualitatively  similar  behavior. 


262 


E.  W.  Jacobs  et  al.  /  A  study  of  iterated  transforms 


Table  2 

Relative  time  to  encode  Lena  for  various  parameters 


•Smax 

nc 

^mtn 

ec 

Resolution 

r(rel) 

Compression  PSNR  (dB) 

1.2 

i 

4 

8 

512 

I 

15.95 

1  33.13 

1.2 

4 

4 

8 

512 

3.1 

17.04 

1  33.19 

1.2 

72 

4 

8 

512 

35.5 

17.87 

1  33.40 

1.0 

4 

4 

8 

512 

3.1 

16.74 

1  33.30 

1.2 

4 

4 

5 

512 

5.3 

10.49 

1  35.92 

1.2 

4 

4 

11 

512 

2.0 

24.62 

1  30.85 

1.2 

72 

8 

8 

512 

7.5 

36.78 

1  30.71 

1.2 

1 

4 

8 

256 

0.14 

9.09 

1  30.63 

1.2 

72 

4 

8 

256 

4.5 

9.97 

1  31.53 

1.2 

72 

4 

10 

256 

3.7 

11.85 

1  30.58 

In  reference  to  Fig.  9,  the  compression  and  PSNR 
obtained  for  a  given  set  of  encoding  parameters 
depends  on  the  image.  Note  that  the  resulting 
PSNR  for  the  image  is  not  necessarily  close  to  ec . 
Therefore  it  would  be  difficult  to  a  priori  choose 
parameters  that  will  result  in  a  target  PSNR.  An 
algorithm  which  accurately  targets  compression 
can  be  made  with  some  simple  modifications  to 
the  encoding  procedure  described  here.  (Instead  of 
quadtreeing  based  on  ec ,  the  encoding  process  can 
be  structured  such  that  quadtreeing  continues  until 
a  target  number  of  transformations  is  reached.)  In 
conclusion,  it  should  be  noted  that  because  enco¬ 
ding  an  image  is  the  interesting  problem,  decoding 
of  images  has  only  been  briefly  mentioned  in  this 
correspondence.  The  decoding  of  images  using  the 
iterated  transform  method  is  inherently  fast 
(requiring  an  iteration  which  is  computationally 
simpler  than  the  inverse  transform  required  for 
ADCT),  an  important  advantage  depending  on  the 
application.  Even  though  vector  quantization  (VQ) 
methods  have  more  in  common  with  iterated  trans¬ 
forms,  ADCT  has  been  used  for  comparison  of 
data  because  the  method  is  more  standardized.  In 
general,  VQ  methods  might  be  expected  to  encode 
and  decode  faster  than  iterated  transforms. 
Because  VQ  uses  a  fixed  code  book  and  iterated 
transforms  is  self  referential,  iterated  transforms 
might  be  expected  to  work  better  for  applications 
that  require  encoding  of  a  wide  variety  of  images. 

Current  work  involves  investigation  of  new  clas¬ 
sification  schemes  that  maintain  PSNR- 
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compression  performance  while  reducing  encoding 
time;  using  linear  combinations  of  domains; 
different  image  partitioning  methods;  and  applica¬ 
tion  to  color  images. 
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